RAW SEQUENCE LISTING 
ERROR REPORT 



The Biotechnology Systems Branch of the Scientific and Technical Information 
Center (STIC) detected errors when processing the following computer readable 
form: 



Application Serial Nximber: 
Source: 

Date Processed by STIC: 




THE ATTACHED PRINTOUT EXPLAINS DETECTED ERRORS. 

PLEASE FORWARD THIS INFORMATION TO THE APPLICANT BY EITHER: 

1) INCLUDING A COPY OF THIS PRINTOUT IN YOUR NEXT COMMUNICATION TO THE 
APPLICANT, WITH A NOTICE TO COMPLY or, 

2) TELEPHONING APPLICANT AND FAXING A COPY OF THIS PRINTOUT, WITH A 
NOTICE TO COMPLY 

FOR CRF SUBMISSION QUESTIONS, PLEASE CONTACT MARK SPENCER, 703-308-4212. 

FOR SEQUENCE RULES INTERPRETATION, PLEASE CONTACT ROBERT WAX, 703-308-4216. 
PATENTIN 2.1 e-mail help: patin21help@uspto>gov or phone 703-306-4119 (R. Wax) 
PATENTIN 3.0 e-mail help: patin3help@uspto.gOV or phone 703-306-4119 (R. Wax) 

TO REDUCE ERRORED SEQUENCE LISTINGS, PLEASE USE THE CHECKER 
VERSION 3.1 PROGRAM , ACCESSIBLE THROUGH THE U.S. PATENT AND 
TRADEMARK OFFICE WEBSITE. SEE BELOW FOR ADDRESS: 
http://vyww.uspto.gov/web/offices/pac/checker 

Applicants submitting genetic sequence information electronically on diskette or CD-Rom should be aware that there is 

a possibility that the disk/CD-Rom may have been affected by treatment giyen to all incoming mail. 

Please consider using alternate methods of submission for the disk/CD-Rom or replacement disk/CD-Rqrn, 

Any reply incJading a sequence listing in electronic form should NOT be sent to the 2023 1 zip code add&ess for the 

United States Patent and Trademark Office, and instead should be sent via the following to the indicated addresses: 

1. EFS-Bio (<http://www.uspto.gov/ebc/efs/downloads/documents.htm> , EFS Submission 
User Manual - ePAVE) 

2. U.S. Postal Service: U.S. Patent and Trademark Office, Box Sequence, P.O. Box 2327, Arlington, VA 22202 

3. Hand Carry directly to: 

U.S. Patent and Trademark Office, Technology Center 1600, Reception Area, 7*** Floor, Examiner Name, 
Sequence Inforn^tion, Crystal Mall One, 1911 South Clark Street, Arlington, VA 22202 

U.S. Patent and Trademark Office, Box Sequence, Customer Window, Lobby, Room 1B03, Crystal Plaza Two, 
201 1 Soutii Clark Place, Arlington, VA 22202 

4. Federal Express, .United Parcel Service, or other delivery service to: U.S. Patent and Trademark Office, 
Box Sequence, Room 1B03-Maib:oom, Crystal Plaza Two, 201 1 South Clark Place, Arlington, VA 22202 




BIOTECHNOLOGY 
SYSTEMS 
BRANCH 




Revised 01/29/2002 



Raw Sequence Lbtlne Error Summary 



KRROR DETECTED 



SUGGESTED CORRECTION 



SERIAL NUMBER: 



ATTN: NEW RULES CASES: FLEASE DISREGARD ENGLISH "ALPILi" HEADERS, WHICH ^RE INSERTEPSpV PTO SOFTWARE 

1 Wrapped Nucleic* The numberAcxl at Iho end of each line "Vrappcd^ down to the next line. This nuy occur if your file 

Wrapped Aminos was retrieved in a word processor afUr aeating it Please adjust your right margin to .3; this will 
prevent •^wapping.•• 



2 Invalid Line Length The lulcs require that a line not ciceed 72 characters in length. This includes white spaces. 

3 Misaligned Amino 
Numbering 



4 Non- ASCII 

Variable Lenglh 



The numbering under each 5* amino acid is misaligned. Do not use lab codes between numi 
use space characters, instead. 

The submitted file was not saved in ASCn(DOS) text, as required by the Sequence Rules. Please 
ensure your subsequent subnUssIon Is saved In ASCII test. 

3 

Sequcnce(s) contain n*s or Xaa*s representing more than one residue. Per Sequence Rules, 

each n or Xaa can only represent a single residue. Please present the maximum number of each 
residue having variable lenglh and indicate in the <220>-<223> seetioMhal some may be missing. 

A "bug** in Patenlln version 2.0 has caused the <220>-<223> section to be missing from amino acid 

sequcnces(s) - Normally, Palentin would automatically generate this section from the 

previously coded nucleic acid sequence. Please manually copy the relevant <220>-<223> section to 
the subsequent anuno acid sequence. TMs applies to the mandatory <220>-<223> sections for 
Artificial or Unknown sequences.' 

_SJdppcd Sequences Sequcncc($) nussing. If intentional, please insert the following lines for each skipped sequence: 

(OLD RULES) (2) INFORMATION FOR SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 

(i) SEQUENCE CHARACTERISTICS: (Do not insert any subheadings under this heading) 

(xi) SEQUENCE DESCRIPTION:SEQ ID NO:X: (insert SEQ ID NO where "X" is shown) 
This sequence is intentionally skipped 



Patcntin 2.0 
"bug" 



_Skippcd Sequences 
(NEW RU.LES) 



Please also adjust the "(ii) NUMBER OF SEQUENCES:" response to Include the skipped sequences. 



Sequence(s) ^ 



missing. If Intentional, please insert the following lines for each skipped sequence. 



<2 10> sequence id number 
<A00> sequence id numt>er 
000 



11 



12 



Use of n*s or Xaa's Use of n*s and/or Xaa's have been detected in the Sequence Listing. 



(NEW RULES) 



10 Invalid <213> 

Response 



Use of<220> 



Patentin 2.0 
" "bug** 



Pa 1.823 of Sequence Rules, use of <220>-<223> is MANDATORY if n*s or Xaa's are present 

In <220> to <223> section, please explain location of n or Xaa, and which residue n or Xa« represents. 

Per 1.K3 of Sequence {(blcs^ the only valid <213> responses are: Unknown, Artificial Sequence; or 
scientific lume (Genus/species). <220>-<223> section is required when <213> response is Unknown or 
^is Artificial Sequence • 



Sequence(s) _ 



missing the <220> ''Feature'* and associated numeric identifiers and responses. 



Use of <220> to <223> is MANDATORY if <213> 'X)rganism- response is "Artificial Sequence" or 
-Unknown.- Please explain source of genetic material in <220> to <223> section. 
(See **Fedcnd Register,- 06/01/1998. VoL 63, No, 104. pp. 2963 1-32) (Sec 1.823 of Sequence Rules) 

Please do not use "X^opy to Disk" function of Patentin version 2.0. This causes a corrupted file, 
resuhing in miuing mandatory numeric identifiers and responses (as indicated on raw sequence 
listing). Instead, please use -File Maiuger- or any other manual means to copy file to floppy disk. 



13 Misuse of n n can only be used to represent a single nodeotide in a nucleic acid sequence. N ii no< used to represent 

any vahie not specifically a oucleodde. 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/784 , 553A 



DATE: 01/17/2003 
TIME: 11:03:34 



Input Set : A:\2459-l-003 Seqlist.txt 
Output Set: N:\CRF4\01172003\l784553A.raw 



3 <110> APPLICANT: Zhou, Ming-Ming 

4 Aggarwal, Aneel 

5 Verdin, Eric 

6 Ott, Melanie 

8 <120> TITLE OF INVENTION: Methods of Identi 
10 <130> FILE REFERENCE: 24 59- 1-003CIP 
12 <140> CURRENT APPLICATION NUMBER: 09/784,55 
C~> 13 <141> CURRENT FILING DATE: 2002-12-30 

15 <150> PRIOR APPLICATION NUMBER: 09/510,314 

16 <151> PRIOR FILING DATE: 2000-02-22 
18 <160> NUMBER OF SEQ ID NOS : 59 

20 <170> SOFTWARE: Patent In version 3.0 

22 <210> SEQ ID NO: 1 

23 <211> LENGTH: 3014 

24 <212> TYPE: DNA 

25 <213> ORGANISM: Homo sapiens 

27 <400> SEQUENCE: 1 

28 ggggccgcgt cgacgcggaa aagaggccgt ggggggcctc 

29 gaggctggca gccgccggca cgcacaccta gtccgcagtc 

30 cagggcgcgg agcagagtcc cgggcaggag aaccaaggga 

31 gcggcagcgg cagcggagcc gctagtcccc tccctcctgg 

32 ccgccgccgc caccaccatc agcgcgcggg gcccggccag 

33 cgctaggggg agggcggggg cggggagggg ggtgggcgaa 

34 ggagggtctc gctctcccga ctaccagagc ccgagggaga 

35 cctgacactc ggcgcctcct gccgtgctcc ggggcggcat 

36 ggccgggcgg ctgcggggca ggagccgggg caggggccgg 

37 agcctgcggc gcttccgccc gcgcccccgc agggctcccc 

38 gctcgggcgc ctgcggtccg gcgacggcag tggctgcagc 

39 gaggcggtgg ctcggcccga atcgccgtga agaaagcgca 

40 ccaagaaact ggagaaactc ggagtgtact ccgcctgcaa 

41 gtaatggctg gaaaaaccct aacccctcac ccactccccc 

42 taattgtcag tctaacagaa tcctgtcgga gttgtagcca 

43 cccacctgga gaatgtgtca gaggaagaaa tgaacagact 
4 4 tggaatatct ctttacctgt gtccacaagg aagaagatgc 
45 tctatctatt taagctcttg agaaagtcta ttttacaaag 
4 6 gctctttgga aaagaaaccc ccatttgaaa aacctagcat 

47 ttgtgcagta caaatttagt cacctgccag caaaagaaag 

48 caaaaatgtt cctaaaccgc atcaactatt ggcatctgga 
4 9 tgcgatctcc caatgatgat atttctggat acaaagagaa 

50 actgcaacgt gccacagttc tgcgacagtc tacctcggta 

51 ggagaacatt gcttcgctcg gtcttcactg ttatgaggcg 

52 gacaggaaaa agataaactg cctcttgaaa aacgaactct 

53 aatttctgtc catgctagaa gaagaagtat atagtcaaaa 



fying Modulators of Bromodomains 
3A 



ccagcgctgg 
ccgaggaaca 
gggcgtgtgc 
gggagcagct 
agcgagccgg 
gggggcggga 
ccctggcggc 
gtccgaggct 
gcccggggcg 
ctgcgccgct 
gggcacggcc 
actacgctcc 
ggccgaggag 
cagagccgac 
tgccctagct 
cctgggaata 
agataccaaa 
aggaaaacct 
tgaacagggt 
gcaaacaata 
ggcaccatct 
ctacacaagg 
cgaaaccaca 
acaactcctg 
aatcctcact 
ctctcccatc 



cagacaccgt 
tgtccgcagc 
tgtggcggcg 
gccgccgctg 
gcgagcggcg 
gggcgtgggg 
ggcggcggcg 
ggcggggccg 
ctgcccccgc 
gccgccgggg 
gaaggaccgg 
gctccgcggg 
tcttgtaaat 
ctgcagcaaa 
gctcatgttt 
gtattggatg 
caagtttatt 
gtggttgaag 
gtgaataact 
gttgagttgg 
caacgaagac 
tggctgtgtt 
caggtgtttg 
gaacaagcaa 
catttcccaa 
tgggatcagg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/784 , 553A 



DATE: 01/17/2003 
TIME: 11:03:34 



Input Set : A:\2459-l-003 Seqlist.txt 
Output Set: N:\CRF4\01172003\l784553A.raw 



54 attttctctc agcctcttcc agaaccagcc agctaggcat ccaaacagtt atcaatccac 1620 

55 ctcctgtggc tgggacaatt tcatacaatt caacctcatc ttcccttgag cagccaaacg 1680 

56 cagggagcag cagtcctgcc tgcaaagcct cttctggact tgaggcaaac ccaggagaaa 1740. 

57 agaggaaaat gactgattct catgttctgg aggaggccaa gaaaccccga gttatggggg 1800 

58 atattccgat ggaattaatc aacgaggtta tgtctaccat cacggaccct gcagcaatgc 1860 

59 ttggaccaga gaccaatttt ctgtcagcac actcggccag ggatgaggcg gcaaggttgg 1920 

60 aagagcgcag gggtgtaatt gaatttcacg tggttggcaa ttccctcaac cagaaaccaa 1980 

61 acaagaagat cctgatgtgg ctggttggcc tacagaacgt tttctcccac cagctgcccc 2040 

62 gaatgccaaa agaatacatc acacggctcg tctttgaccc gaaacacaaa acccttgctt 2100 

63 taattaaaga tggccgtgtt attggtggta tctgtttccg tatgttccca tctcaaggat 2160 

64 tcacagagat tgtcttctgt gctgtaacct caaatgagca agtcaagggc tatggaacac 2220 

65 acctgatgaa tcatttgaaa gaatatcaca taaagcatga catcctgaac ttcctcacat 2280 

66 atgcagatga atatgcaatt ggatacttta agaaacaggg tttctccaaa gaaattaaaa 2340 

67 tacctaaaac caaatatgtt ggctatatca aggattatga aggagccact ttaatgggat 2400 

68 gtgagctaaa tccacggatc ccgtacacag aattttctgt catcattaaa aagcagaagg 2460 

69 agataattaa aaaactgatt gaaagaaaac aggcacaaat tcgaaaagtt taccctggac 2520 

70 tttcatgttt taaagatgga gttcgacaga ttcctataga aagcattcct ggaattagag 2580 

71 agacaggctg gaaaccgagt ggaaaagaga aaagtaaaga gcccagagac cctgaccagc 2640 

72 tttacagcac gctcaagagc atcctccagc aggtgaagag ccatcaaagc gcttggccct 2700 

73 tcatggaacc tgtgaagaga acagaagctc caggatatta tgaagttata aggttcccca 2760 

74 tggatctgaa aaccatgagt gaacgcctca agaataggta ctacgtgtct aagaaattat 2820 

75 tcatggcaga cttacagcga gtctttacca attgcaaaga gtacaacgcc gctgagagtg 2880 

76 aatactacaa atgtgccaat atcctggaga aattcttctt cagtaaaatt aaggaagctg 2940 

77 gattaattga caagtgattt tttttccccc tctgcttctt agaaactcac caagcagtgt 3000 

78 gcctaaagca aggt 3014 

80 <210> SEQ ID NO: 2 

81 <211> LENGTH: 832 

82 <212> TYPE: PRT 

83 <213> ORGANISM: Homo sapiens 
85 <400> SEQUENCE: 2 

87 Met Ser Glu Ala Gly Gly Ala Gly Pro Gly Gly Cys Gly Ala Gly Ala 
1 5 



88 



10 



15 



90 Gly Ala Gly Ala Gly Pro Gly Ala Leu Pro Pro Gin Pro Ala Ala Leu 



20 



25 



30 



Pro Pro Ala Pro Pro Gin Gly Ser Pro Cys Ala Ala Ala Ala Gly Gly 

45 



91 
93 

94 35 40 

96 Ser Gly Ala Cys Gly Pro Ala Thr Ala Val Ala Ala Ala Gly Thr Ala 

97 50 55 60 

99 Glu Gly Pro Gly Gly Gly Gly Ser Ala Arg lie Ala Val Lys Lys Ala 

100 65 70 75 80 

102 Gin Leu Arg Ser Ala Pro Arg Ala Lys Lys Leu Glu Lys Leu Gly Val 

103 85 90 95 

105 Tyr Ser Ala Cys Lys Ala Glu Glu Ser Cys Lys Cys Asn Gly Trp Lys 



106 



100 



105 



110 



108 Asn Pro Asn Pro Ser Pro Thr Pro Pro Arg Ala Asp Leu Gin Gin lie 



109 



115 



120 



125 



111 lie Val Ser Leu Thr Glu Ser Cys Arg Ser Cys Ser His Ala Leu Ala 



112 



130 



135 



140 



114 Ala His Val Ser His Leu Glu Asn Val Ser Glu Glu Glu Met Asn Arg 
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RAW SEQUENCE LISTING DATE: 01/17/2003 

PATENT APPLICATION: US/09/784 , 553A TIME: 11:03:34 

Input Set : A:\2459-l-003 Seqlist.txt 
Output Set: N:\CRF4\01172003\l784553A.raw 



115 


145 










150 










155 










160 


117 


Leu 


Leu 


Gly 


He 


Val 


Leu 


Asp 


Val 


Glu 


Tyr 


Leu 


Phe 


Thr 


Cys 


Val 


His 


118 










165 










170 










175 




120 


Lys 


Glu 


Glu 


Asp 


Ala 


Asp 


Thr 


Lys 


Gin 


Val 


Tyr 


Phe 


Tyr 


Leu 


Phe 


Lys 


121 








180 










185 










190 






123 


Leu 


Leu 


Arg 


Lys 


Ser 


He 


Leu 


Gin 


Arg 


Gly 


Lys 


Pro 


Val 


Val 


Glu 


Gly 


124 






195 










200 










205 








126 


Ser 


Leu 


Glu 


Lys 


Lys 


Pro 


Pro 


Phe 


Glu 


Lys 


Pro 


Ser 


He 


Glu 


Gin 


Gly 


127 




210 










215 










220 










129 


Val 


Asn 


Asn 


Phe 


Val 


Gin 


Tyr 


Lys 


Phe 


Ser 


His 


Leu 


Pro 


Ala 


Lys 


Glu 


130 


225 










230 










235 










240 


132 


Arg 


Gin 


Thr 


He 


Val 


Glu 


Leu 


Ala 


Lys 


Met 


Phe 


Leu 


Asn 


Arg 


.He 


Asn 


133 










245 










250 










255 




135 


Tyr 


Trp 


His 


Leu 


Glu 


Ala 


Pro 


Ser 


Gin 


Arg 


Arg 


Leu 


Arg 


Ser 


Pro 


Asn 


136 








260 










265 










270 






138 


Asp 


Asp 


He 


Ser 


Gly 


Tyr 


Lys 


Glu 


Asn 


Tyr 


Thr 


Arg 


Trp 


Leu 


Cys 


Tyr 


139 






275 










280 










285 








141 


Cys 


Asn 


Val 


Pro 


Gin 


Phe 


Cys 


Asp 


Ser 


Leu 


Pro 


Arg 


Tyr 


Glu 


Thr 


Thr 


142 




290 










295 










300 










144 


Gin 


Val 


Phe 


Gly 


Arg 


Thr 


Leu 


Leu 


Arg 


Ser 


Val 


Phe 


Thr 


Val 


Met 


Arg 


145 


305 










310 










315 










320 


147 


Arg 


Gin 


Leu 


Leu 


Glu 


Gin 


Ala 


Arg 


Gin 


Glu 


Lys 


Asp 


Lys 


Leu 


Pro 


Leu 


148 










325 










330 










335 




150 


Glu 


Lys 


Arg 


Thr 


Leu 


He 


Leu 


Thr 


His 


Phe 


Pro 


Lys 


Phe 


Leu 


Ser 


Met 


151 








340 










345 










350 






153 


Leu 


Glu 


Glu 


Glu 


Val 


Tyr 


Ser 


Gin 


Asn 


Ser 


Pro 


He 


Trp 


Asp 


Gin 


Asp 


154 






355 










360 










365 








156 


Phe 


Leu 


Ser 


Ala 


Ser 


Ser 


Arg 


Thr 


Ser 


Gin 


Leu 


Gly 


He 


Gin 


Thr 


Val 


157 




370 










375 










380 










159 


lie 


Asn 


Pro 


Pro 


Pro 


Val 


Ala 


Gly 


Thr 


He 


Ser 


Tyr 


Asn 


Ser 


Thr 


Ser 


160 


385 










390 










395 










400 


162 


Ser 


Ser 


Leu 


Glu 


Gin 


Pro 


Asn 


Ala 


Gly 


Ser 


Ser 


Ser 


Pro 


Ala 


Cys 


Lys 


163 










405 










410 










415 




165 


Ala 


Ser 


Ser 


Gly 


Leu 


Glu 


Ala 


Asn 


Pro 


Gly 


Glu 


Lys 


Arg 


Lys 


Met 


Thr 


166 








420 










425 










430 






168 


Asp 


Ser 


His 


Val 


Leu 


Glu 


Glu 


Ala 


Lys 


Lys 


Pro 


Arg 


Val 


Met 


Gly Asp 


169 






435 










440 










445 








171 


He 


Pro 


Met 


Glu 


Leu 


He 


Asn 


Glu 


Val 


Met 


Ser 


Thr 


He 


Thr 


Asp 


Pro 


172 




450 










455 










460 










174 


Ala 


Ala 


Met 


Leu 


Gly 


Pro 


Glu 


Thr 


Asn 


Phe 


Leu 


Ser 


Ala 


His 


Ser 


Ala 


175 


465 










470 










475 










480 


177 


Arg 


Asp 


Glu 


Ala 


Ala 


Arg 


Leu 


Glu 


Glu 


Arg 


Arg 


Gly 


Val 


He 


Glu 


Phe 


178 










485 










490 










495 




180 


His 


Val 


Val 


Gly 


Asn 


Ser 


Leu 


Asn 


Gin 


Lys 


Pro 


Asn 


Lys 


Lys 


He 


Leu 


181 








500 










505 










510 






183 


Met 


Trp 


Leu 


Val 


Gly 


Leu 


Gin 


Asn 


Val 


Phe 


Ser 


His 


Gin 


Leu 


Pro 


Arg 


184 






515 










520 










525 








186 


Met 


Pro 


Lys 


Glu 


Tyr 


He 


Thr 


Arg 


Leu 


Val 


Phe 


Asp 


Pro 


Lys 


His 


Lys 


187 




530 










535 










540 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/784 , 553A 



DATE: 01/17/2003 
TIME: 11:03:34 



Input Set : A:\2459-l-003 Seqlist.txt 
Output Set: N; \CRF4\01172003\I784553A. raw 



189 Thr Leu Ala Leu He Lys Asp Gly Arg Val He Gly Gly He Cys Phe 

190 545 550 555 560 

192 Arg Met Phe Pro Ser Gin Gly Phe Thr Glu He Val Phe Cys Ala Val 

193 565 570 575 

195 Thr Ser Asn Glu Gin Val Lys Gly Tyr Gly Thr His Leu Met Asn His 

196 580 , 585 590 

198 Leu Lys Glu Tyr His He Lys His Asp He Leu Asn Phe Leu Thr Tyr 

199 595 600 605 

201 Ala Asp Glu Tyr Ala He Gly Tyr Phe Lys Lys Gin Gly Phe Ser Lys 

202 610 615 *620 

204 Glu He Lys He Pro Lys Thr Lys Tyr Val Gly Tyr He Lys Asp Tyr 

205 625 630 635 640 

207 Glu Gly Ala Thr Leu Met Gly Cys Glu Leu Asn Pro Arg He Pro Tyr 

208 645 650 655 

210 Thr Glu Phe Ser Val He He Lys Lys Gin Lys Glu He He Lys Lys 

211 660 665 670 

213 Leu He Glu Arg Lys Gin Ala Gin He Arg Lys Val Tyr Pro Gly Leu 

214 675 680 685 

216 Ser Cys Phe Lys Asp Gly Val Arg Gin He Pro He Glu Ser He Pro 

217 690 695 700 

219 Gly He Arg Glu Thr Gly Trp Lys Pro Ser Gly Lys Glu Lys Ser Lys 

220 705 710 715 720 

222 Glu Pro Arg Asp Pro Asp Gin Leu Tyr Ser Thr Leu Lys Ser He Leu 

223 725 730 735 

225 Gin Gin Val Lys Ser His Gin Ser Ala Trp Pro Phe Met Glu Pro Val 

226 740 745 750 

228 Lys Arg Thr Glu Ala Pro Gly Tyr Tyr Glu Val He Arg Phe Pro Met 

229 755 760 765 

231 Asp Leu Lys Thr Met Ser Glu Arg Leu Lys Asn Arg Tyr Tyr Val Ser 

232 770 775 780 

234 Lys Lys Leu Phe Met Ala Asp Leu Gin Arg Val Phe Thr Asn Cys Lys 

235 785 790 795 800 

237 Glu Tyr Asn Ala Ala Glu Ser Glu Tyr Tyr Lys Cys Ala Asn He Leu 

238 805 810 815 

24 0 Glu Lys Phe Phe Phe Ser Lys He Lys Glu Ala Gly Leu He Asp Lys 
241 820 825 830 

243 <210> SEQ ID NO: 3 
24*4 <211> LENGTH: 12 
245 <212> TYPE 




^^ach of these can be any ^ 



PRT 

246 <213> ORGANISM: /^artificial sequence 

248 <220> FEATURE: 

W — > 249 <221> NAME/KEY: X 

250 <222> LOCATION: (2).. (2) 

251 <223> OTHER INFORMATION: 
amino acid 

253 <220> FEATURE: 
W — > 254 <221> NAME/KEY: X . 
255 <222> LOCATION: { 4 ) . . ( 4;/ 



256 <223> OTHER INFORMATION: The X is/five to eigh t amino acj^s .^^Each ^of these can be any 
amino acid ' ' ' 




258 <220> FEATURE; 
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RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/784 , 553A 



DATE: 01/17/2003 
TIME: 11:03:34 



Input Set : A:\2459-l-003 Seqlist.txt 
Output Set: N:\CRF4\01172003\l784553A.raw 



W — > 



W — > 



W--> 259 

260 
261 
263 
264 
265 
266 
268 
269 
270 
271 
or His 

273 
W — > 274 
275 
276 
279 
W — > 280 
281 
282 
, 284 
W— > 286 
287 
289 
290 
291 
292 
294 
295 
296 
297 
300 
302 
303 
305 
306 
307 
308 
310 
W~> 311 
312 
313 
316 
W— > 318 
319 
321 
322 
323 
324 
326 



<221> NAME/KEY: X 

<222> LOCATION: (5) . . 
<223> OTHER INFORMATION: 
<220> FEATURE: 
<221> NAME/KEY: X 
<222>. LOCATION: (6) . . (6) 
<223> OTHER INFORMATION: 
<220> FEATURE: 
<221> NAME/KEY: X 
<222> LOCATION: (8).. (8) 
<223> OTHER INFORMATION: 



X is a single amino acid that is either Pro, Lys, or His. 



This X is any single amino acid. 



This X is a single amino acid that can be either Tyr, Phe, 




L^^^am 



is a single amino acid that can be Met, lie, or Val. 



W 



<220> FEATURE.: 
<221> NAME/KEY: X 

<222> LOCATION: (9) . . (9 
<223> OTHER INFORMATION 
<220> FEATURE: 
<221> NAME/KEY: X 
<222> LOCATION: (11).. (11) 
<223> OTHER INFORMATION: X 
<4 00> SEQUENCE^3 

Phe Xaa Pro Xaa Xaa Xaa Tyr Xaa Xaa Pro Xaa Asp 

1.5 10 
<210> SEQ ID NO: 4 
<211> LENGTH: 12 
<212> TYPE: PRT 
<213> ORGANIS 
<220> FEATURE 
<221> NAME/KEY: Xaa 
<222> LOCATION: (6) . . (6) 
<223> OTHER INFORMATION: 
<400> SEQUENCE: 4 

lie Ser Tyr Gly Arg Xaa Lys Arg Arg Gin Arg Arg 

15 10 
<210> SEQ ID NO: 5 
<211> LENGTH: 14 
<212> TYPE: PR' 

<213> ORGANIS^ artificial sequence 
<220> FEATURE: 
<221> NAME/KEY: X 
<222> LOCATION: (8).. (8) 
<223> OTHER INFORMATION: 
<400> SEQUENCE: 5 

Ala Arg Lys Ser Thr Gly Gly Xaa Ala Pro Arg Lys Gin Leu 



Each of these can be any amino acid. 




The X represents an acetyl-lysine 



9 



0^ 

The X represents an acetyl lysine. 



1 

<210> 
<211> 
<212> 
<213> 
<220> 



•10 



SEQ ID NO: 
LENGTH: 14 
TYPE: 
ORGAN I 

featur: 
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fhe ^pos of exrors shown exist throughout 
he sequence Listing. Please check subsequent 
sequences for similar errors. ^ 



The types of errors shown exist throughout 
the Sequence Listing. Please check subsequent 
sequences for similar errors. 



1/17/03 



6 



equence^ 



<210> SEQ ID NO 44 
<211> LENGTH: 20 
<212> TYPE; PRT^^f — ' ''^ 
<2^:g^0RGANISiyi^;|^^a^ s 

^'^iToAFEATURE : — " 

^223>/OTHER INFORMATION: 
<^W> SEQUENCE: 4 4 

Trp Pro Phe Met Glu Pro Val Lys Arg Thr Glu Ala Pro Gly Tyr Tyr 
15 10 15 

Glu Val He Arg 
20 



<210> 


SEQ ID NO 45 


























<211> 


LENGTH: 101 




























<212> 


TYPE: PRT 


























<213> 


ORGANISM: Human 


immunodeficiency virus type 


1 


c 






<400> 


SEQUENCE: 4 5 




























Met Glu Pro 


Val 


Asp 


Pro 


Arg 


Leu 


Glu 


Pro 


Trp 


Lys 


His 


Pro 


Gly 


Ser 




1 




5 










10 










15 






Gin Pro Lvs 


Thr 
20 


Ala 


Ser 


Asn 


Asn 


Cys 
25 


Tyr 


Cys 


Lys 


Arg 


Cys 
30 


Cys 


Leu 




His Cys Gin 


Val 


Cys 


Phe 


Thr 


Lys 


Lys 


Gly 


Leu 


Gly 


He 


Ser 


Tyr 


Gly 




35 










40 










45 










Arg Lys Lys 


Arg 


Arg 


Gin 


Arg 


Arg 


Arg 


Ala 


Pro 


Gin 


Asp 


Ser 


Lys 


Thr 




50 








55 










60 












His Gin Val 


Ser 


Leu 


Ser 


Lys 


Gin 


Pro 


Ala 


Ser 


Gin 


Pro 


Arg 


Gly 


Asp 




65 






70 










75 










80 




Pro Thr Gly 


Pro 


Lys 
85 


Glu 


Ser 


Lys 


Lys 


Lys 
90 


Val 


Glu 


Arg 


Glu 


Thr 
95 


Glu 




Thr Asp Pro 


Glu 
100 


Asp 

























<210> SEQ 
<211> LENGTH 
<212> TYPE: PRT 



ID NO 4 6 ^ . ^ Q 
TH: 7 M^P^ I 




<213> ORGANISM <^tificial sequence/ 



<220> FEATURE 
<221> NAME/KEY: X 
<222> LOCATION: (5) . . (5) 

<223> OTHER INFORMATIOl^ X is one to three amino acids 
any amino 
acid 

<4 00> SEQUENCE: 4 6 




ach amino acid can be 



Tyr Gly Arg Lys Xaa Arg Gin 
1 5 



<210> SEQ ID NO 47 
<211> LENGTH: 10 
<212> TYPE: PRT 

<2J^ ORGANISM(^^__artificial sequenc^- 
x^'^20^ FEATURE: 
jC22J ^ OTHER INFORMATION: 
<400> SEQUENCE: 47 

Ser Tyr Gly Arg Lys Lys Arg Arg Gin Arg 
15 10 

<210> SEQ ID NO 48 
<211> LENGTH: 10 
<212> TYPE: PRT 
<213> ORGANISM^^ artificial 
<220> FEATURE: 
<221> NAME/KEY: X 
<222> LOCATION: (2) . . (2) 
<223> OTHER INFORMATION 
amino aci 
d 

<220> FEATURE: 
<221> NAME/KEY: X 
<222> LOCATION: (4).. (4) 




Each of these can be any 



;ids .1 



<223> OTHER INFORMATION: X ife two to four amino acids^ Each of these can be any 

amino aci ^ ^ — ^ 

d \ 
<220> FEATURE: ' 

<221> NAME/KEY: X l/]f\jJlf^4/ 

<222> LOCATION: (6).. (6) yr^ — ^ 

<223> OTHER INFORMATION: X i^two to four amino acidsJ Each of these can be any 
amino aci 
d 

<220> FEATURE: 
<221> NAME/KEY: X 
<222> LOCATION: (8).. (8) 

<223> OTHER INFORMATION: X is one\ to three amino acids^ 
any amino ac 
id 

<220> FEATURE: 
<221> NAME/KEY: X 
<222> LOCATION: (10) . . (10) 
<223> OTHER INFORMATION: X is a single amino acid that is either lie. Leu, Met, 
or Val. 

<4 00> SEQUENCE: 4 8 

Phe Xaa Val Xaa Glu Xaa Tyr Xaa Val Xaa 
15 10 




Each of these can be 



<210> SEQ ID NO 49 
<211> LENGTH: 62 

<212> TYPE: PRT ^ 

^GANISM: Artificial sequenced 
ATURE: — ^ 




CHER INFORMATION: : 
<4 0Ty> SEQUENCE: 4 9 

Phe Met Glu Pro Val Lys Arg Thr Glu Ala Pro Gly Tyr Tyr Glu Val 
15 10 15 



RAW SEQUENCE LISTING ERROR SUMMARY 

PATENT APPLICATION: US/09/784 , 553A 



DATE: 01/17/2003 
TIME: 11:03:35 



Input Set : A:\2459-l-003 Seqlist.txt 
Output Set: N:\CRF4\01172003\l784553A.raw 



Please Note: 

Use of n and/or Xaa have been detected in the Sequence Listing. Please review the 
Sequence Listing to ensure that a corresponding explanation is presented in the <220> 
to <223> fields of each sequence which presents at least one n or Xaa. 



Seq#: 


3; 


Xaa 


Pos, 


^ .1 


Seq#: 


4; 


Xaa 


Pos. 




Seq#: 


5; 


Xaa 


Pos, 


r 

8' 


Seq#: 


6; 


Xaa 


Pes . 


Seq#: 


43, 


Xaa 


Pos 


: i,2 


Seq#: 


46, 


Xaa 


Pos 


. 5 


Seq#: 


48, 


' Xaa 


Pos 


Seq#: 


50, 


Xaa 


Pos 


. 5 


Seq#: 


51, 


• Xaa 


Pos 


. 5 


Seq#: 


52, 


' Xaa 


Pos 


. 5 


Seq#: 


53, 


• Xaa 


Pos 


. 5 


Seq#: 


54, 


• Xaa 


Pos 


. 5 


Seq#: 


55, 


• Xaa 


Pos 


. 5 


Seq#: 


56, 


' Xaa 


Pos 


. 5 


Seq#: 


57, 


Xaa 


Pos 


. 6 


Seq#: 


58, 


Xaa 


Pos 


. 7 


Seq#: 


59, 


Xaa 


Pos 


. 16 



,4/,p,^;8<9/l\/ 



Invalid Line Length: 

The rules require that a line not exceed 72 characters in length. This includes spaces. 

Seq#:34; Line(s) 1117 
Seq#:35; Line(s) 1145 

Invalid <213> Response: 

Use of "Artificial" only as "<213> Organism" response is incomplete, 

per 1.823(b) of New Sequence Rules. Valid response is Artificial Sequence. 

Seq#:48 

Use of <220> Feature (NEW RULES) : 

Sequence (s) ^are missing the <220> Feature and associated headings. 

Use of <220> to <223> is MANDATORY if <213> ORGANISM is "Artificial Sequence" 
or"Unknown", Please explain source of genetic material in <220> to <223> 
section (See "Federal Register," 6/01/98, Vol. 63, No. 104 ,pp . 29631-32) 
(Sec. 1.823 of new Rules) i 



Seq#: 44, 47, 49. 
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VERIFICATION SUMMARY DATE: 01/17/2003 

PATENT APPLICATION: US/09/784 , 553A TIME: 11:03:35 

Input Set : A:\2459-l-003 Seqlist.txt 
Output Set: N:\CRF4\01172003\l784553A.raw 

L:13 M:271 C: Current Filing Date differs. Replaced Current Filing Date 



L: 


249 


M: 


257 


W 


Feature 


value mi's 


-spelled or invalid. 


<221> 


Name/Key 


for 


SEQ 


ID# 


3 


L: 


254 


M: 


257 


W 


Feature 


value mis 


-spelled or invalid. 


<221> 


Name/Key 


for 


SEQ 


ID# 


3 


L: 


259 


M: 


257 


W 


Feature 


value mis 


-spelled or invalid. 


<221> 


Name /Key 


for 


SEQ 


ID# 


3 


L: 


264 


M: 


257 


W 


Feature 


value mis 


-spelled or invalid. 


<221> 


Name/Key 


for 


SEQ 


ID# 


3 


L: 


269 


M: 


257 


W 


Feature 


value mis 


-spelled or invalid. 


<221> 


Name/Key 


for 


SEQ 


ID# 


3 


L: 


274 


M: 


257 


W 


Feature 


value mis 


-spelled or invalid. 


<221> 


Name/Key 


for 


SEQ 


ID# 


3 


L: 


280 


M: 


257 


W 


Feature 


value mis 


-spelled or invalid. 


<221> 


Name/Key 


for 


SEQ 


ID# 


3 


L: 


286 


M: 


341 


W 


(4 6) "n 


" or "Xaa" 


used, for SEQ ID#:3 


after 


pos . : 0 










L; 


295 


M: 


257 


•w 


Feature 


value mis 


-spelled or invalid. 


<221> 


Name /Key 


for 


SEQ 


ID# 


4 


L: 


302 


M: 


341 


W 


(4 6) "n 


" or "Xaa" 


used, for SEQ ID#:4 


after 


pos . : 0 










L: 


311 


M: 


257 


W 


Feature 


value mis 


-spelled or invalid. 


<221> 


Name/Key 


for 


SEQ 


ID# 


5 


L: 


318 


M: 


341 


W 


(4 6) "n 


" or "Xaa" 


used, for SEQ ID#:5 


after 


pos . : 0 










L: 


327 


M: 


257 


W 


Feature 


value mis 


-spelled or invalid, 


<221> 


Name /Key 


for 


SEQ 


ID# 


6 


L: 


334 


M: 


341 


W 


(4 6) "n 


" or "Xaa" 


used, for SEQ ID# : 6 


after 


pos . : 0 ' 











L:1387 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:43 
L:1393 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:43 
L:1400 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:43 
L:1407 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:43 
L:1413 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:43 
L:1419 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:43 
L:1425 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:43 
L:1431 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:43 
L:1438 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:43 after pos.:0 
L:1446 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ#:44, <213> 
ORGANISM: artificial sequence 

L:1446 M:258 W: Mandatory Feature missing, <223> Tag not found for SEQ#:44, <213> 
ORGANISM: artificial sequence 

L:1446 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 44 , Line# : 144 6 
L:1488 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:46 
L:1496 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:46 after pos . : 0 
L:1504 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ#:47, <213> 
ORGANISM: artificial sequence 

L:1504 M:258 W: Mandatory Feature missing, <223> Tag not found for SEQ#:47, <213> 
ORGANISM: artificial sequence 

L:1504 .M:258 W; Mandatory Feature missing, <223> Blank for SEQ# : 47 , Line# : 1504 
L:1515 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:48 
L:1522 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:48 
L:1529 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:48 
L:1536 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:48 
L:1543 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:48 
L;1550 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:48 after pos . : 0 
L:1558 M:258 W: Mandatory Feature missing, <220> Tag not found for SEQ#:49, <213> 
ORGANISM: artificial sequence 

L:1558 M:258 W: Mandatory Feature missing, <223> Tag not found for SEQ#:49, <213> 
ORGANISM: artificial sequence 

L:1558 M:258 W: Mandatory Feature missing, <223> Blank for SEQ# : 4 9, Line# : 1558 
L:1578 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:50 
L:1585 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:50 after pos . : 0 

L:1594 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:51 
L:1601 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:51 after pos . : 0 

L:1610 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:52 
L:1617 M:341 W: (46) "n" or "Xaa" used, for SEQ ID#:52 after pos.:0 
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L:L626 M:257 W: Feature value mis-spelled or invalid, <221> Name/Key for SEQ ID#:53 
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Ik 



VERIFICATION SUMMARY 

PATENT APPLICATION: US/09/784 , 553A 



DATE: 01/17/2003 
TIME: 11:03:35 



L:1633 
L:1642 
L: 1649 
L:1658 
L:1665 
L: 1674 
L: 1681 
L: 1687 
L: 1690 
L: 1697 
L: 1706 
L:1713 
L:1722 
L:1729 



M:341 W 
M:257 W 
M:341 W 
M:257 W 
M:341 W 
M:257 W 
M:341 W 
M:220 C 
M:257 W 
M:341 W 
M:257 W 
M:341 W 
M:257 W 
M:341 W 



Input Set : A:\2459-l-003 Seqlist.txt 
Output Set: N:\CRF4\01172003\l784553A.raw 

(46) "n" or "Xaa" used, for SEQ ID#:53 after pos . : 0 
Feature value mis-spelled or invalid, <221> Name/Key 
(46) "n" or "Xaa" used, for SEQ ID#:54 after pos.:0 
Feature value mis-spelled or invalid, <221> Name/Key 
(46) "n" or "Xaa" used, for SEQ ID#:55 after pos.:0 
Feature value mis-spelled or invalid, <221> Name/Key 
(46) "n" or "Xaa" used, for SEQ ID#:56 after pos . : 0 
Keyword misspelled or invalid format, <213> ORGANISM 
Feature value mis-spelled or invalid, <221> Name/Key 
(46) "n" or "Xaa" used, for SEQ ID#:57 after pos.:0. 
Feature value mis-spelled or invalid, <221> Name/Key 
(46) "n" or "Xaa" used, for SEQ ID#:58 after pos.:0 
Feature value mis-spelled or invalid, <221> Name/Key 
(46) "n" or "Xaa" used, for SEQ ID#:59 after pos . : 0 



for SEQ ID#:54 

for SEQ ID#:55 

for SEQ ID#:56 

for SEQ ID#:57 
for SEQ ID#:57 

for SEQ ID#:58 

for SEQ ID#:59 
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